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Abstract 

We report results of a new global next-to-leading order fit of parton distribution functions in 
which cuts on W and Q are relaxed, thereby including more data at high values of x. Effects of 
target mass corrections (TMCs), higher twist contributions, and nuclear corrections for deuterium 
data are significant in the large- x region. The leading twist parton distributions are found to 
be stable to TMC model variations as long as higher twist contributions are also included. The 
behavior of the d quark as x — > 1 is particularly sensitive to the deuterium corrections, and using 
realistic nuclear smearing models the d-quark distribution at large x is found to be softer than in 
previous fits performed with more restrictive cuts. 
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I. INTRODUCTION 



The distribution of up and down quarks in the proton is one of the most fundamental 
characterizations of the structure of the ground state of QCD. Several decades of accumu- 
lated data from a variety of hard scattering processes, together with sophisticated, next-to- 
leading order (NLO) QCD analyses, has produced a detailed mapping of the proton's parton 
distribution functions (PDFs) over a large range of kinematics [1-4] . 

A dominant role in this endeavor has been played by lepton-nucleon deep inelastic scat- 
tering (DIS) data, which provide direct information on the behavior of the quark PDFs, 
as well as on the gluon distribution via the observed logarithmic scaling violations and the 
imposition of the momentum sum rule on the PDFs. However, the kinematics of many deep 
inelastic experiments limits the coverage in Bjorken x, so that our knowledge of PDFs does 
not extend uniformly over the entire x-range, especially so at large x. The invariant mass 
squared of the produced hadronic system is given by 



where M denotes the mass of the target nucleon and Q 2 the photon virtuality. As x — > 1 at 
fixed Q 2 , W approaches values in the nucleon resonance region. This region may be treated 
using the concept of quark-hadron duality [5], but this is beyond the scope of the present 
analysis. In order to simultaneously stay in the region of large W and access high values of 
x, one must therefore go to increased values of Q 2 ; in practice, though, this is limited by 
the available beam energy. Finally, the cross section falls rapidly as x and Q 2 increase, so 
the data samples can become statistics-limited in this region. 

Knowledge of PDFs at large x, defined here to be x > 0.5, is important for a number 
of reasons. Apart from its intrinsic value in providing a laboratory for studying the flavor 
and spin dynamics of quarks, the large- a; region is unique in allowing perturbative QCD 
predictions to be made for the x dependence of PDFs in the limit x — > 1 [6]. Furthermore, the 
d/u quark ratio at large x is very sensitive to different mechanisms of spin-flavor symmetry 
breaking in the nucleon [7] . Reliable knowledge of PDFs at large x may also be important 
for searches of new physics signals in collider experiments, where uncertainties in PDFs at 
large x and low Q 2 percolate through Q 2 evolution to affect cross sections at smaller x and 
larger Q 2 [8] . This is especially true if the search involves a region where the rapidity is large, 
where one is sensitive to products of PDFs evaluated with one value of x being small and 
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the other large. An example is provided by the planned PHENIX measurements of polarized 
gluon distributions, Ag, at small x by detecting VF-bosons at large rapidity: since Ag is 
convoluted with large-x unpolarized quark distributions the precision of the measurements 
will depend on the precision to which the quark PDFs are known at large x. 

Nuclear PDFs at large x are also important in the analysis of neutrino oscillation ex- 
periments such as T2K [9], NOz/A [10], and DUSEL [11]. A large part of the theoretical 
uncertainty is due to lack of precise knowledge of the neutrino-nucleus interaction in the 
kinematics between the DIS and resonance regions, as well as in the implementation of the 
non-isoscalarity correction. Better control of nuclear corrections at large x and a precise 
knowledge of the d/u ratio will have a direct and measurable impact on the interpretation 
of such experiments. This analysis is also timely given the increasing base of large-x mea- 
surements available, including, among others, DIS data from Jefferson Lab, and Drell-Yan 
and VF-asymmetry data from Fermilab. 

In order to improve our knowledge of PDFs in the large- a; region, and to thereby reduce 
their errors, in this work we expand the coverage in x provided by the deep inelastic data by 
relaxing the W > 3.5 GeV and Q > 2 GeV cuts traditionally used to limit the theoretical 
analysis to leading twist. This requires the development of the methodology for treating 
1/Q 2 suppressed terms such as target mass corrections and higher twist contributions. Fur- 
thermore, as one probes deeper into the large- a; region, nuclear corrections must be included 
when dealing with nuclear targets, as these become increasingly important as x — > 1. 

It is a priori not obvious that the uncertainties on the PDFs will be reduced by relaxing 
the cuts to include more data, since the various new theoretical contributions will have 
uncertainties of their own which could increase the resulting uncertainties on the extracted 
PDFs. It is the purpose of this paper to explore precisely this question. While some previous 
attempts have been made in the literature to include lower-Q and \ower-W data in global 
fits and to explore the related uncertainties [12-14], in this work we specifically focus on the 
large-x region. 

The outline of the paper is as follows. Section II summarizes the theoretical issues which 
must be addressed in the region of large x, while Sec. Ill addresses the choice of data sets 
and the procedures utilized in the fitting. In Sec. IV the results of the fits are presented 
and discussed, and Sec. V describes the resulting d/u ratio. Our conclusions are presented 
in Sec. VI. In keeping with the convention adopted in previous CTEQ fits [1, 15], we shall 
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refer to the new fit from this analysis as "CTEQ6X." 

II. THEORETICAL ISSUES AT LARGE x 

From a theoretical standpoint the large- a; region requires the inclusion not only of leading 
twist PDFs, but also of contributions suppressed by at least one power of Q 2 , which include 
target mass corrections and higher twist corrections. The former are kinematic in origin and 
involve terms suppressed by powers of M 2 /Q 2 ; the latter are dynamical in origin and are 
suppressed as powers of A 2 /Q 2 at large Q 2 , with A measuring the scale of nonperturbative 
parton-parton correlations. Furthermore, if a nuclear target is used, one must include nuclear 
corrections which account for the difference between the PDFs in free nucleons and those in 
nucleons bound in a nucleus. These corrections are not suppressed at high Q 2 , and are most 
significant at large x. Historically, to limit the effects of the target mass and higher twist 
corrections it has been customary to restrict the data included in the fits to large values of 
Q and W. In this section we discuss the implications of these cuts and the theoretical tools 
needed to relax them. 

A. Kinematic cuts 

To avoid theoretical complications in PDF analyses at large x it has been common to limit 
the deep inelastic data used to those points which satisfy Q > 2 GeV and W > 3.5 GeV. 
These combined cuts have the effect of severely limiting the number of DIS data points in 
the large- a; region that may be used to constrain PDFs in a global fit. However, as discussed 
in Refs. [16, 17], they do not completely eliminate the need for 0(1/ Q 2 ) power corrections, 
whose neglect is a source of tension between several DIS data sets. 

Other hard scattering processes, such as the production of lepton pairs, vector bosons, 
high-p T jets, and direct photons, involve a large momentum scale analogous to Q 2 in deep 
inelastic scattering. The values are typically sufficiently large that one avoids target mass 
corrections and higher twist contributions. However, the relevant range in x is typically set 
by x T = 2p T /y/s for high-pr scattering or by Mi+i- / 'y/s for the production of a lepton pair 
of mass Mi+i-, and data for such processes rarely constrain the large-x behavior of PDFs 
(although some exceptions to this will be discussed below). 
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Given all of the restrictions described above, one might well wonder how it is that any 
constraints at all are provided on the possible behavior of PDFs at large x. The answer is 
that there are indirect constraints which are provided by sum rules and the Q 2 evolution 
of the PDFs. The momentum and number sum rules provide some constraints since they 
involve integrations over the entire range of x. However, since the PDFs fall off as powers of 
(1 — x), the large- a; region contributes only small amounts to these sum rules. In addition, 
the DGLAP Q 2 evolution equations, used to calculate the PDFs at values of Q 2 above that 
where the initial PDFs are parametrized, have the property that the PDFs at high values of 
x and low Q 2 feed the behavior at lower values of x and higher Q 2 . Hence, there are indirect 
constraints placed on the large-rr PDF behavior by data in regions of lower x and higher Q 2 . 

It is important to understand that when one uses a supplied parametrization of PDFs in 
the large- a; region the results represent extrapolations of the PDFs into regions where they 
have not been directly constrained by data. In order to reduce the size of the unconstrained 
region and check the validity of the large- a; extrapolations obtained in standard global fits, 
one needs to account for the neglected theoretical corrections, which we discuss below. 

B. Nuclear corrections 

To probe the x dependence of the d quark to the same level of accuracy attained for the 
u quark requires lepton scattering data from a neutron target. Since free neutron targets 
do not exist, the next best choice is to utilize a deuterium target. Because the deuteron 
is a very weakly bound nucleus, many analyses which include proton and deuteron data 
make the assumption that it can be treated as a sum of a free proton and neutron. On the 
other hand, it has long been known from experiments on a range of nuclei that a nontrivial 
x dependence exists for ratios of nuclear to deuteron F2 structure functions. These effects 
include nuclear shadowing at small values of x, ant i- shadowing at intermediate x values, 
x ~ 0.1, a reduction in the structure function ratio below unity for 0.3 < x < 0.7, known 
as the European Muon Collaboration (EMC) effect, and a rapid rise as x — > 1 due to 
Fermi motion. Moreover, deviations from unity of the ratio of deuteron to proton structure 
functions corrected for non-isoscalarity, as well as many theoretical studies, strongly suggest 
the presence of nuclear effects also in the deuteron. 

In the absence of experiments with free neutron targets, the size of the nuclear effects in 
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the deuteron has yet to be determined empirically, although expectations are that it should 
be somewhat smaller than for heavy nuclei on the basis of its much lower binding energy. 
Nevertheless, in the region x > 0.5 the effects of nuclear Fermi motion lead to a rapidly 
increasing ratio of deuteron to nucleon structure function, which diverg 1. Any 

high-precision analysis of the large- a; region must therefore account for the nuclear effects if 
data on deuterium (or other nuclei) are used in the fit. 

The conventional approach to describing nuclear structure functions in the intermediate- 
and large- a; regions is the nuclear impulse approximation, in which the virtual photon scat- 
ters incoherently from the individual bound nucleons in the nucleus. There have been 
many formulations of nuclear structure functions in the literature within this framework 
(see e.g. Refs. [18-21]). In this analysis we adopt the weak binding approximation (WBA) 
of Refs. [22-25], where a covariant framework is used to relate the nucleon and deuteron 
scattering amplitudes, which are systematically expanded in powers of p/M, with p the 
bound nucleon momentum in the deuteron. The deuteron structure function can then be 
written as a convolution of the bound nucleon structure function and the distribution of 
nucleons in the nucleus. Because the struck nucleon is off its mass shell with virtuality 
p 2 = Po — p 2 < M 2 , its structure function can in principle depend on p 2 , in addition to x 



Since the deuteron is weakly bound, with binding energy e D = —2.2 MeV and average 
nucleon momenta of the order |p| ~ 130 MeV, the typical nucleon virtuality will be ~ 4% 
smaller than the free nucleon mass, M. One can therefore approximate the bound nucleon 
structure function by its on-shell value, in which case the deuteron structure function can 
be written, to order p 2 /M 2 , as [24-26] 



Here F 2 JV is the nucleon (proton p or neutron n) structure function, and f^/d gives the 
light-cone momentum distribution of nucleons in the deuteron. The scaling variable y = 
(Md/M)(p ■ q/pd ■ q) is the deuteron's momentum fraction carried by the struck nucleon, 
where q is the virtual photon momentum, and pd and M d are the deuteron four-momentum 
and mass. The maximum value of y, to order p 2 /M 2 , is given by y max = 1 + 7 2 /2 + e<i/M, 
so that in the Bjorken limit y max — > 1.5 + e^/M. Relativistically the upper limit on y 
is given by M d /M, which suggests that the nonrelativistic approximation breaks down at 



and Q 2 . 





6 



large y. A careful treatment of relativistic corrections will be important as x > 1; however, 
for the x range covered in this analysis a nonrelativistic treatment is sufficient. In the 
Bjorken limit the nucleon distribution function f N / d (also called the "smearing function") 
is a function of y only and is limited to y < M^/M. At finite Q 2 , however, it depends in 



addition on 7 = y 1 + Ax 2 M 2 / Q 2 , which in the nucleus rest frame coincides with the virtual 
photon "velocity" \q\/qo, with significant consequences when fitting large-x deuterium data 
[25]. Furthermore, at finite Q 2 the limits of integration in y in general become x and Q 2 
dependent, although their effects are negligible for x < 0.85 [24, 26]. 

The function f^/d is computed from the deuteron wave function, and accounts for the 
effects of Fermi motion and binding energy of the nucleons in the nucleus, as well as kine- 
matic 1/Q 2 corrections [24-26]. In our numerical calculations we use several different non- 
relativistic deuteron wave functions, based on the Paris [27], AVI 8 [28] and CD-Bonn [29] 
nucleon-nucleon potentials, but find that the differences are small for x < 0.85. 

Including explicit off-shell dependence in the bound nucleon structure function leads to 
a two-dimensional convolution in terms of y and the nucleon virtuality p 2 . For small val- 
ues of \p 2 — M 2 \/M 2 <C 1 the off-shell effects can be incorporated into a generalization of 
Eq. (2) in which the nucleon distribution function depends explicitly on x, in addition to 
y and 7. However, at larger x higher-order corrections in p/M become increasingly impor- 
tant, and the factorization of the nucleon and nuclear functions inherent in the convolution 
approximation is expected to break down [30, 31]. In this case relativistic effects can be 
incorporated through an additive correction, F 2 d — > F% + 5 (off )F 2 d , which arises from explicit 
p 2 dependence in the quark-nucleon correlation functions and relativistic P-state compo- 
nents of the deuteron wave function. The latter have been computed, for instance, within 
the relativistic spectator theory for the deuteron [32]. 

The off-shell dependence of the bound nucleon structure functions computed in Ref. [30] 
has been estimated within a simple quark-spectator model [31] with the parameters fitted 
to proton and deuteron F 2 data, and leads to a reduction of about 2% of F% compared to 
the on-shell approximation. A simple parametrization of the relativistic and off-shell effects 
in the model of Ref. [30], relative to the total F 2 d , is given by 

r(off) pd 

— ^ = a (1 + a lX ^){\ - [a 3 - x a ^) , (3) 
with at = {-0.014, 3, 20, 1.067, 1.5, 18} at a scale of Q 2 « 5 GeV 2 . Note that this correction 



is applicable only for x > 0.2, and for the values of x relevant to the current analysis one 
finds 5^ F$ / F$ < 1.5%. Furthermore, because the off-shell nucleon-deuteron amplitude 
was evaluated in Ref. [30] using wave functions derived from a pseudoscalar nN interaction 
[32], which is known to produce large P-state contributions, the correction (3) is likely to 
provide an upper limit on the size of the relativistic, convolution-breaking effects. 

Finally, we note that while a few global PDF analyses [12-14, 33] have incorporated 
nuclear effects in the deuteron using smearing functions similar to those employed here, 
some studies [34] have corrected deuteron data using a parametrization [35] of the nuclear 
density model [36] . This model is motivated by the observation that for heavy nuclei the ratio 
of nuclear to deuterium structure functions scales with the nuclear matter density [21, 36]. 
Assuming the density scaling extends all the way down to the deuteron, one can obtain the 
ratio of deuteron to nucleon structure functions from the ratio of iron to deuterium structure 
functions, F 2 d /F 2 JV 1+0.25 (F 2 Fe /F 2 rf — 1), using the empirical nuclear density for 56 Fe and an 
ansatz for the charge density of deuterium [36, 37]. Consequently the F^/F^ ratio displays 
a relatively large depletion at x ~ 0.6 and a rise above unity which does not set in until 
x ~ 0.8, significantly higher than typically found in smearing models. Unfortunately for 
light nuclei, and especially deuterium, it is difficult to define physically meaningful nuclear 
densities [37], so that the application of the density model to deuteron data inevitably suffers 
from ambiguities. Furthermore, even for heavy nuclei the authors of Ref. [36] caution that 
the density model should only be considered qualitative beyond x 0.6-0.7. 

C. Target mass corrections 

In the context of the operator product expansion (OPE) in QCD, deep inelastic scat- 
tering is formulated through moments of structure functions, which are related to forward 
matrix elements of local operators. The x dependence of structure functions is formally 
reconstructed from the moments via an inverse Mellin transform. At large Q 2 the process is 
dominated by matrix elements of operators of twist two, such as the quark bilinear ip^ifj. 
Operators which include insertions of covariant derivatives, vp^^D^ 1 ■ ■ ■ D^ip, do not alter 
the twist, but have matrix elements that enter as M 2 /Q 2 corrections to the 0(1) terms 
[38, 39]. These "target mass corrections" (TMCs), which are of purely kinematic origin, 
need to be removed from the empirical structure function data before information on the 
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twist-two PDFs can be extracted. 

One of the limitations of the OPE formulation of TMCs is the so-called "threshold prob- 
lem", in which the target mass corrected structure functions remain nonzero at x > 1. This 
arises from the difficulty in consistently incorporating the elastic threshold in moments of 
structure functions at finite Q 2 , resulting in nonuniformity of the Q 2 — > oo and n — > oo 
limits, where n is the rank of the moment. A number of attempts to address the unphysical 
x — > 1 behavior have been made [24, 40, 41], although none of the proposed approaches is 
free of additional assumptions or complications [42]. 

Alternatively, using the collinear factorization (CF) formalism [43, 44] one can avoid the 
threshold ambiguities from the outset by formulating TMCs directly in momentum space. 
Recently this formalism was applied [45] to deep inelastic structure functions at large x 
at next-to-leading order, carefully taking into account the elastic threshold to render the 
structure functions zero above x = 1. However, in the handbag approximation, without 
introducing a suitable jet function to account for the invariant mass of the final hadronic 
state, leading order structure functions can still be nonzero at x — 1 [45]. A simplified version 
of the CF formalism, which involves replacing the Bjorken variable x with the Nachtmann 
variable f = 2x/ (l + y/l + 4x 2 M 2 /qA , was proposed in Refs. [46, 47]. At leading order 
this "^-scaling" prescription coincides with the CF results. Beyond leading order, however, 
this overestimates the CF results by about 20-30% [45, 48]. 

In this analysis we consider both the traditional OPE approach and the collinear fac- 
torization formulations of TMCs for DIS reactions. For deuterium data we first apply the 
TMCs to the nucleon structure function, which is then convoluted through Eq. (2) to obtain 
the deuteron F$. Although there is some residual dependence on the TMC prescription 
adopted, we will see in Sec. IV below that the resulting PDFs are essentially independent of 
the choice once allowance is made for dynamical higher twist and other power corrections. 
For non-DIS data, such as Drell-Yan, W^-asymmetry and jet production in hadronic colli- 
sions, the target mass and nuclear corrections should play a negligible role because of the 
limited reach at large x and the typically large momentum scales involved, and are therefore 
neglected in our analysis. 
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D. Dynamical power corrections 



In the OPE framework, dynamical higher twist contributions to DIS structure functions 
are associated with matrix elements of operators involving multi-quark or quark and gluon 
fields, which give rise to l/Q 2 or higher order corrections to structure functions. As with the 
kinematic target mass corrections, these must be taken into account in analyses of data at 
low Q 2 and especially at large x. Because dynamical higher twists involve nonperturbative 
multiparton interactions, it is notoriously difficult to quantify their magnitude and shape 
from first principles. 

The usual approach in analyses whose main aim is the extraction of leading twist PDFs 
is either to parametrize the higher twist contributions by a phenomenological form and fit 
the parameters to the experimental data [49, 50], or to extract the Q 2 dependence by fitting 
it in individual bins in x [13, 17, 51-53]. Such an approach effectively includes contributions 
from multiparton correlations (the true higher twist contributions) along with other power 
corrections that are not yet part of the theoretical treatment of DIS at low Q 2 . These include 
0(1/Q 2 ) contributions such as jet mass corrections [45] and soft gluon resummation [54], 
as well as contributions which are of higher order in a s but whose logarithmic Q 2 behavior 
mimics terms oc l/Q 2 at low virtuality [53, 55]. For these reasons it is more appropriate to 
speak about residual power corrections rather than higher twist corrections. Nevertheless, 
we shall refer to all of these l/Q 2 suppressed effects as "higher twist" corrections in order 
to conform with the terminology frequently used in the literature, but keeping in mind their 
possibly disparate origins. 

We employ the commonly-used phenomenological form for the total structure function, 



where F 2 LT is the leading twist component including TMCs, and the coefficient of the l/Q 2 
term is parametrized (in units of GeV 2 ) as 



The parameter c\ reflects the overall scale of the higher twist corrections, while C2 controls the 
well-known rise of the coefficient C (x) at large x; the parameter C3 allows for the possibility 
of negative higher twists at smaller x. For deuterium targets we use the same higher twist 





C(x) = c 1 x C2 (l + c 3 x) . 



(5) 
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term C(x) for the bound proton and neutron F 2 structure functions. In so doing we neglect 
higher twist contributions coming from parton rescattering on the spectator nucleon [56] 
due to the small atomic number, which would slightly increase the deuteron's dynamical 
higher twist contribution compared to that of the proton. These assumptions may need to 
be relaxed in future analyses, see e.g. Refs. [53, 57]. 

We stress the importance of explicitly including both the TMCs and the higher twist 
corrections, which have very different physical origin and can have very different x depen- 
dence. Since the higher twists are fitted phenomenologically, one could in principle simu- 
late TMC effects with an empirical power corrections function with a sophisticated enough 
parametrization. We have considered several alternative forms for C(x), but were not able 
to find an efficient parametrization which embodies both the higher twist and TMC effects 
and allows a good fit. On the contrary, inclusion of TMCs in the LT structure function in 
Eq. (4) allows us to adopt a very economical 3-parameter functional form for C(x). 

III. DATA SETS AND FITTING 

We perform NLO global PDF fits to proton and deuteron data from inclusive deep in- 
elastic fixed-target scattering experiments at Jefferson Lab (JLab), SLAC, and the BCDMS 
and New Muon Collaborations (NMC) at CERN; ep collider data from HI and ZEUS; E605 
and E866 Drell-Yan data from pp and pd collisions; WMepton asymmetry data from CDF 
and DO; W asymmetry data from CDF; and jet and 7+jet cross sections from DO. The full 
list of data sets, together with a comparison to the data sets included in the most recent 
CTEQ6.1 fits [15], is provided in Table I. 

The standard cuts on DIS data in previous global analyses have excluded data with 
Q 2 < 4 GeV 2 and W 2 < 12.25 GeV 2 , which means that effectively the PDFs have only 
been constrained up to x ~ 0.7. As discussed in Sec. I, extending the coverage to larger x 
requires relaxing the Q 2 and W 2 cuts. To test the effects on the PDFs as the kinematic cuts 
are relaxed, we consider several cuts in addition to the standard one, which we denote as 
follows: 

• cutO: Q 2 > 4 GeV 2 , W 2 > 12.25 GeV 2 (standard) 

• cutl: Q 2 > 3 GeV 2 , W 2 > 8 GeV 2 
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TABLE I: Data sets and number of data points (total and deuterium) used in the global fits 
discussed in the text. The data sets with a check mark (yf) were used in the CTEQ6.1 fits [15]. 



total deuterium 





cutO cut3 


cutO cut 3 


CTEQ6.1 


Dm 
i / 1 . j 


TT ah 


[OBJ 


272 




136 






SLAC 


[591 


206 1147 


104 


582 






NMC 


[601 


324 464 


123 


189 


V 




BCDMS 


[611 


590 605 


251 


254 


V 




HI 


[621 
L J 


230 251 






V 




ZEUS 


[63] 


229 240 






V 


i/A DIS 


CCFR 


[64, 65] 






J 

V 


DY 


E605 


[66] 


119 










E866 


[67] 


375 


191 




W asymmetry CDF '98 (£) 


[68] 


11 










CDF '05 (£) 


[69] 


11 










DO '08 {£) 


[70] 


10 










DO '08 (e) 


[71] 


12 










CDF '09 (W) [72] 


13 








jet 


CDF 


[73] 


33 






V 




DO 


[74] 


90 






V 


7+jet 


DO 


[75] 


56 






TOTAL 


2408 3709 


569 


1161 





• cut2: Q 2 > 2 GeV 2 , W 2 > 4 GeV 2 

• cut3: Q 2 >m 2 c = 1.69 GeV 2 , W 2 > 3 GeV 2 . 

In Figure 1 we plot the (x, Q 2 ) coverage of the DIS data sets used in this analysis, along with 
the above kinematic cuts. The approximate doubling of the number of data points included 
in the fits going from cutO (iNfois = 1579) to cut3 (Afcis = 3079) is evident. In particular 
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FIG. 1: DIS data points and cuts for proton (left) and deuterium (right) targets, with the cutO- 
cut3 kinematic cuts drawn from top to bottom with solid lines. The data points are from JLab 
(open red squares), SLAC (open blue triangles), NMC (filled green triangles), BCDMS (crosses), 
HI (filled black circles), ZEUS (open black circles). 

we can include most of the SLAC data points, which reach the lowest Q 2 values, and about 
half of the recent JLab data points. At the lower Q 2 values, we include only those JLab data 
for which an estimate of the uncertainties related to parametrizations of the longitudinal to 
transverse cross section ratio is available. 

The methodology used in the global fits follows that of the CTEQ6 series of global fits [15]. 
In particular, the same functional forms for the PDF parametrizations at the initial scale 
Qo = 1.3 GeV are employed. Several of the parameters are fixed at representative values 
and a total of 20 PDF parameters are varied in the fits. An additional 3 free parameters 
are introduced for the parametrization of the higher twist corrections, and error bands are 
calculated using the Hessian technique. The Ax tolerance used to define the errors in global 
fits has been extensively discussed in the literature, and various groups use values ranging 
from A%=1 to 10. We quote the error bands with Ax=l, and simply use them to show the 
relative variation of the errors as the cuts are varied and between different PDFs. 
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IV. FIT RESULTS 



Having described the expanded data sets used in this study and the new ingredients 
necessary for fitting the data in the large- a; region, in this section we present the numerical 
results of our analysis. 

A. Reference fit and kinematic cuts 

The first step in this study is to define a reference fit in order to have a standard against 
which to compare the results of subsequent fits as new terms are added and new sets of cuts 
are imposed. This is achieved using the data listed in Table I with the cutO set of kinematic 
cuts. Target mass, higher twist and nuclear corrections (for deuterium data) are not included 
in the reference fit, which should therefore yield results similar to those of the CTEQ6.1 
PDFs [15]. Small differences can be expected, however, due to some additional data sets 
used. For example, the inclusion of the SLAC and E866 data induces a small reduction of 
the w-quark distribution at large x. The E866 data also lead to a sizable enhancement of 
the (i-quark distribution at large x, which is partially offset by the jy-asymmetry and 7+jet 
data. 

The results for the u and d distributions are shown in Fig. 2(a) as ratios to the cor- 
responding CTEQ6.1 PDFs. As anticipated, the u distribution is slightly reduced in the 
large-x region, while the d distribution shows a small increase for x > 0.3, leading to a mod- 
est increase of the d/u ratio at intermediate x. The dotted vertical line at x — 0.7 indicates 
the restriction on the range in x for which there are sufficient DIS data to constrain the 
PDFs with cutO kinematic limits. Accordingly, the portion of each curve extending beyond 
the dotted line represents an extrapolation of the fitted results. 

The gluon and sea quark PDFs are not discussed here because they are too poorly con- 
strained at large x by the chosen data sets. More specifically, the gluon distribution is 
partially constrained at intermediate and low x by the scaling violations in DIS and by the 
jet data. At larger x, however, it is only indirectly constrained by the momentum sum rule. 
This can be seen by considering the effects of a small change in one of the valence distri- 
butions, such as described above, which alters their average momentum. With the gluon 
already constrained somewhat in the intermediate- to low-x region, it is the large-x gluon 



14 



2 
1.5 
1 

0.5 
2 

1.5 
1 

0.5 



Z_ "ref/ ^CTEQ6.1 



1 — I I I I I I I — I I I I I I I I I I I 



u ref /u 



(a) 



II I ll III I 



d ref /d 



ret' ^CTEQ6.1 




Q 2 =10 GeV 2 



j i i I i i i I i i i I i i i I i i i_ 



2 
1.5 
1 

0.5 
2 

1.5 
1 

0.5 



J 1 II 1 II II II II 1 
L U / U ref 


1 1 1 1 1 1 

( b ) = 


: cutO+TMC+HT 

— cutO+nuc 

- cutO+nuc+TMC + Hl 

III II 1 I I 


- 

i i 1 i i i i" 


L d / d ref 


1 1 1 1 1 1 1 1 i 


r Q 2 =10 GeV 2 

"l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


1 1 1 1 1 1 1 1 1 



0.2 0.4 0.6 0.8 1 



0.2 0.4 0.6 0.8 
x 



1 



FIG. 2: (a) Impact of the expanded data set, with the standard cuts (cutO) and no target mass, 
higher twist or nuclear corrections applied. Illustrated are ratios of the PDFs from the resulting 
"reference" fit to those from the CTEQ6.1 fit. (b) Effects on the reference fit as TMC (using the 
CF prescription), higher twist (HT) and nuclear corrections (WBA) are added. 

which is modified in order to satisfy the momentum sum rule; since the gluon distribution is 
much smaller than the valence distributions at large x, this results in large variations of the 
high-x gluons in order to compensate the rather small changes in the valence PDFs. The 
large variations in the shape of the gluon PDF at large x then propagate to the sea PDFs 
via the actions of the evolution equations. Discussions on how to better experimentally 
constrain the gluon distributions at large x are contained in Refs. [13, 14]. 

Having verified that the earlier CTEQ6.1 results could be reproduced up to small varia- 
tions due to the different data sets used, the next step consists of a systematic investigation 
of the effects of including target mass, higher twist, and nuclear corrections, while also re- 
ducing the cuts on W and Q. It is instructive to first consider the impact of each of these 
corrections on the reference fit, which is illustrated in Fig. 2(b). One can see that the cutO 
constraints effectively limit any change to the reference fit u PDF. On the other hand, while 
the d PDF shows little sensitivity to the target mass and higher twist effects, it does show 
that the nuclear corrections to the deuterium data have a profound effect. The nuclear 
effects, as calculated in the framework of the weak binding approximation [22, 24, 25], cause 
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FIG. 3: (a) Effect on the d-quark distribution from varying the Q and W cuts, including target 
mass (CF), higher twist and nuclear corrections (using the WBA nuclear smearing model). The 
vertical dotted lines indicate the x region which is not directly constrained by DIS data (x > 0.7 for 
cutO and x > 0.8 for cut3). (b) Sensitivity of the (i-quark distribution to nuclear correction models 
with cut3 kinematics: no nuclear effects ("free", dot-dashed), nuclear density model (long-dashed), 
WBA nuclear smearing model without (solid) and with (short-dashed) off-shell corrections. 

a large reduction in the d PDF relative to the reference fit starting at x ~ 0.5. The les- 
son for PDF global fits is that, even with the cutO constraints on the data selection, one 
should include nuclear corrections for the deuterium data. The cutO constraints, however, 
effectively remove the need for the inclusion of the TMC and HT contributions, as was the 
original intent (which also confirms the analogous results of Ref. [33]). Of course, the PDFs 
are then directly constrained only for x < 0.7. 

With the reference fit as a starting point we next turn on the TMC, add the higher 
twist parametrization, include the deuteron nuclear correction, and then perform fits for 
the various sets of cuts denoted by cutO through cut3. Progressively loosening the cuts 
naturally brings more data into play. These data provide additional constraints at high 
values of x, and one can see in Fig. 3(a) that the reduction in the d PDF is less pronounced 
than with cutO. Thus, the extrapolation of the cutO results into the region beyond x ~ 0.7 
actually shows too strong a reduction. Remarkably, the lowering of the W and Q cuts does 
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not qualitatively affect the <i-quark suppression beyond x ps 0.5, which remains a stable 
feature for all fits considered. 

B. Sensitivity to nuclear corrections 

As suggested in Fig. 2(a) for cut3 kinematics, and illustrated in Fig. 3(b) for cut3, the 
behavior of the ci-quark distribution at large x is driven primarily by the nuclear effects. With 
no nuclear corrections (treating the deuteron as a sum of a free proton and free neutron), 
the d quark is strongly enhanced above x ~ 0.5. This occurs because the Fermi motion in 
the deuteron causes an enhancement of the data for F% beyond what would be expected by 
treating the deuteron as the sum of free nucleons; if no smearing corrections are included, 
the resulting (i-quark distribution must make up this enhancement as the u distribution is 
already well constrained at large values of x by the proton data. A qualitatively similar 
result is found using the nuclear density model [36], which arises directly from the delayed 
onset of Fermi motion in this model to x > 0.8, and the corresponding smaller deviation of 
the F2IF2 ratio from unity in this region. 

On the other hand, correcting for the nuclear effects in the F 2 d data using the WBA 
finite-Q 2 smearing model discussed in Sec. II B leads to a rf-quark distribution with the 
opposite trend at large x relative to the reference fit, as Fig. 3(b) illustrates. At x — 0.8 this 
amounts to a ~ 40% reduction of the d PDF relative to the reference fit. Not accounting for 
the nuclear smearing in deuterium data will thus lead to a significant overestimate of the d 
distribution at x > 0.6. This will be the case in fact for a wide range of nuclear smearing 
models, and regardless of the details of the deuteron wave function. 

Because the nucleons in the deuteron are slightly off-shell, with \p 2 — M 2 \/M 2 = O (few %), 
in principle this can affect the extracted neutron structure function and resulting rf-quark 
distribution. Using the model for off-shell corrections discussed in Sec. II B does indeed result 
in an increase in the d distribution at large x compared with the on-shell calculation, which 
reflects the larger deviation from unity of the F^j ratio in the presence of off-shell effects 
[22, 24, 30, 31, 76]. A simple way of understanding this is to consider the struck nucleon as 
having a slightly smaller mass than a free nucleon, p 2 = M* 2 < M 2 . In the target rest frame 
the effective scaling variable is then x* = Q 2 /2M*u, so the actual value of x is larger than 
would be expected for a free nucleon. This leads to the observed structure function being 
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slightly reduced in this region, as can be seen from Eq. (3) where the reduction is « —1.5%. 

To quantify the impact of the off-shell corrections on the ci-quark distribution one may 
consider the leading order structure functions in the large- £ region, which is dominated by 
the valence PDFs, 

F -5*"( 1 + £) and = + < 6 > 

and examine the effect of a small variation in F 2 d caused by correcting for off-shell effects. 
Since there is no variation in as an approximation the proton structure function can 
be assumed to remain the same. The modifications to the u- and ci-quark PDFs are then 
related by 5u = —\8d, and using Eq. (6) gives 

d 3 F{ \ i) ' 1 ' 

A small change in F 2 d thus results in a relative shift in the ci-quark distribution that is 
enhanced by a factor of order u/d. For example, at x — 0.8, the ratio u/d 20, and a shift 
of 1.5% in F 2 d leads to a increase of the d distribution of about 40%. This is similar to what 
is observed in Fig. 3(b). 

The same argument also explains the large sensitivity of the large- a; ci-quark PDF to the 
choice of the nuclear correction model. Currently this represents the single most important 
theoretical uncertainty on large- £ PDFs, which can affect not only the extraction of the 
d/u ratio at large x (see Sec. V), but also the determination of Tevatron and LHC parton 
luminosities at large produced mass [12]. 



C. Sensitivity to TMC and HT corrections 

The sensitivity of the ci-quark distribution to the finite-Q 2 corrections associated with 
target mass and higher twist effects is illustrated in Fig. 4, where three different TMC pre- 
scriptions are illustrated: the OPE-based prescription [39, 42], the Nachtmann ^-scaling 
[46, 47], and the collinear factorization prescription [45], along with a fit with no TMCs. 
Although there is some dependence on the choice of TMCs, with the addition of the phe- 
nomenological higher twist correction (4) the resulting fits in Fig. 4(a) are almost identical, 
with x 2 /d.o.f. ~ 1. This reveals an important interplay between the target mass and higher 
twist corrections, which tend to compensate each other in the fitting procedure. 
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FIG. 4: (a) Effects of target mass and higher twist corrections on the d-quark distribution, using 
the ^-scaling (short-dashed), OPE (long-dashed) and CF (solid) TMC prescriptions, as well as no 
TMC (dot-dashed). In each case a phenomenological higher twist correction is included in the fit, 
and the x 2 /d.o.f. is indicated, (b) Extracted higher twist coefficient C(x) for the different TMC 
prescriptions, compared with the higher twist determination from the MRST fit [52]. 

As discussed in Sec. II, given a sophisticated enough parametrization of the higher twist 
corrections, one could in principle accommodate the entire TMC effect, even though the 
existence of TMCs is well-established theoretically. In practice, however, it is preferable 
to include the TMC and HT terms separately because of the different models in use for 
the TMCs, and the separate HT parametrization is able to compensate the different model 
choices. Furthermore, it allows us to use a very economical functional form for the higher 
twist corrections with only 3 parameters. In contrast, with no TMCs we found no suitable 
parametrization able to produce a satisfactory fit with a similar x 2 - 

Given that there is very little difference between the (i-quark distribution obtained using 
the different TMC choices, it is clear that one cannot determine which is the preferred TMC 
prescription from these fits alone. On the other hand, the gluon distribution at large x 
obtained by analyzing the scaling violations of F2 does depend somewhat on the chosen 
TMC prescription because of the different dependence on Q 2 . One can therefore include 
longitudinal structure function or cross section data in the fits to directly constrain the 
gluon distribution. A comparison of the fitted gluon distribution with the scaling violations 
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of F 2 may then allow the TMC models to be distinguished. 

The interplay between TMCs and higher twists is more vividly demonstrated in Fig. 4(b), 
where the higher twist coefficient C(x) is plotted for the various TMC prescriptions. The 
characteristic rise with increasing x is evident, with the higher twist term being larger for 
the CF prescription than for the others. The extracted higher twist coefficient for the case 
with no TMCs is also qualitatively similar to that obtained in the MRST analysis [52] , which 
did not include target mass or deuterium corrections. The negative higher twist term at 
x ~ 0.3 evident with the OPE and no TMC prescriptions disappears when using the CF or 
^-scaling approaches. 

The extracted HT coefficient for the OPE prescription is about half of that obtained 
by Bliimlein et al. (BBG) [33, 53] in fixed- £ bins. This may be due in part to our HT 
parametrization being too restrictive. On the other hand, the BBG parton distributions 
have been fitted with cutO kinematic cuts and then extrapolated to smaller x and Q 2 before 
extracting the HT terms. This extrapolation may underestimate the d quark distribution 
at large x, similarly to our result in Figure 3(a), therefore leading to an overestimate of the 
HT terms. 

We emphasize, however, that our main finding here is not the magnitude of the higher 
twist correction per se, but the fact that very stable fits to leading twist PDFs for the u 
and d quarks can be obtained by the inclusion of both the target mass and higher twist 
corrections. 

D. Final fit results 

We conclude this section with the presentation in Fig. 5 of the final fit results for the cut3 
set of constraints, which we refer to as "CTEQ6X" . One can see the significant reduction in 
the d PDF at large values of x that was discussed previously, with the vertical lines again 
indicating the approximate limits of the fit constraints. As shown in Fig. 1, the proton 
D1S data extend to x — 0.9, and consequently the u distribution is well constrained up 
to that limit. The d distribution, however, is constrained only up to x = 0.8, reflecting 
the maximum extent of the deuterium data. The error bands on the PDFs correspond to 
Ax = 1 and include systematic and statistical experimental uncertainties, but exclude the 
theoretical uncertainties discussed above. 
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FIG. 5: Results of the CTEQ6X fit with expanded kinematics (cut3) and inclusion of TMC, HT and 
nuclear corrections, normalized to the CTEQ6.1 PDFs. The vertical lines show the approximate 
values of x above which PDFs are not directly constrained by data. The error bands correspond 
to A X = 1. 

One of the goals of this investigation was to examine the extent to which the errors on 
the PDFs would be affected by the addition of the new data made accessible by relaxing the 
selection criteria applied to the DIS data. This is shown in Fig. 6 where the relative errors on 
the u and d PDFs are shown, normalized to the relative errors from the reference fit. These 
ratios are shown for the cutO through cut3 kinematic cuts. For cutO and cutl the errors 
typically increase because the amount of new DIS data is not sufficient to compensate for the 
added 3 free parameters that describe the phenomenological higher twist contributions. For 
cut2 and cut3 there is a substantial reduction in the uncertainty of these PDFs due to the 
increased data, with the cut3 errors reduced by 10-20% for x < 0.6, and by up to 40-60% 
at larger x. However, it must be stressed that these errors only reflect the uncertainties due 
to propagating the experimental errors. In particular, uncertainties associated with nuclear 
corrections are not shown; these will be discussed in more detail in a subsequent analysis 
[77]. 

As a consistency check of the new fits, in Fig. 7 we show the ratio F^jlF^ computed from 
the new PDFs and compare them to data at Q 2 = 6, 12 and 20 GeV 2 [78] using several models 
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FIG. 6: Relative PDF errors for u and d quarks normalized to the relative errors in the reference 
fit. 

for the nuclear corrections, as in Fig. 3(b). The fit with no nuclear corrections is clearly 
disfavored, while all three nuclear correction model curves show a \ 2 which is improved 
to some degree. The nuclear density model correction has difficulty in reproducing the Q 2 
dependence of the data, especially at high x. The WBA nuclear smearing model, on the 
other hand, is strongly Q 2 dependent at large x and rises more steeply than the density 
model as x — > 1 when Q 2 > 10 GeV 2 . The addition of off-shell corrections has very little 
effect on the fit: essentially, the <i-quark PDF shifts to compensate the inclusion of the off- 
shell corrections and the PDFs obtained with these two models provide an estimate of the 
uncertainty due to the model dependence of the nuclear corrections. A detailed comparison 
to data over a wider range of Q 2 would help to clarify the nature of the Q 2 dependence of 
the nuclear corrections, as also emphasized recently by Arrington et al. [78]. 

V. d/u RATIO 

According to spin-flavor SU(6) symmetry with no flavor- dependent interactions between 
quarks, one would expect d/u = 1/2 for all x. Empirically the d/u ratio of course deviates 
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FIG. 7: Ratio of deuteron to proton Fi structure functions computed from the cut3 PDFs at 
Q 2 = 6, 12 and 20 GeV 2 for various nuclear correction models, as in Fig. 3. Data points are taken 
from the compilation of SLAC, BCDMS and NMC data in Ref. [78], with W 2 < 3 GeV 2 data 
denoted by open circles. 

strongly from this naive expectation, and its x — > 1 behavior is a particularly sensitive 
indicator of the dynamics responsible for the symmetry breaking. If the interaction between 
the two valence quarks that are spectators to the hard collision is mediated by a spin- 
dependent color-magnetic force, such as from single gluon exchange (which also accounts for 
the mass splitting between the nucleon and A [79]), the two-quark state with spin will be 
energetically favored relative to that with spin 1. A dominant scalar "diquark" component 
of the proton would then lead to a suppression of the d quark distribution and a vanishing 
d/u ratio as x — > 1 [80]. On the other hand, if the dominant scattering process involves 
scattering from quarks with the same helicity as that of the proton, as would be expected 
from perturbative QCD, then this mechanism of SU(6) breaking would lead to d/u — > 1/5 
in the x — > 1 limit [81]. The approach of the PDFs to the x = 1 limiting values may also 
reveal the role played by quark orbital angular momentum in the nucleon [82]. 

In this section we explore in more details what the new fits can tell us about the large- £ 
behavior of the d/u ratio. We discuss the sensitivity of our d/u fits to nuclear corrections, 
and review future experiments which can more directly constrain the d distribution as x — > 1. 
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FIG. 8: (a) d/u quark distribution ratio for the reference, CTEQ6.1 and CTEQ6X (cut3) fits, 
(b) Ratio of the reference and CTEQ6X (cut3) to the CTEQ6.1 d/u ratios. In both panels the 
dotted lines indicate the region where the PDFs are not directly constrained by DIS data. 

A. Effect of nuclear corrections on d/u 

The ratio of the d to u distributions is shown in Fig. 8 for the CTEQ6.1, reference, and 
CTEQ6X fits, together with predictions from various models for the limiting behavior as 
x — y 1. The rapid fall-off at high values of x for the CTEQ6X PDFs (with cut3) suggests 
that our results favor a lower d/u value that is more consistent with the model of scalar 
diquark dominance of the proton wave function than with models that predict large d/u 
asymptotic values. In fact, relative to the standard PDF fits which assume no nuclear 
corrections in the deuteron, the extracted d/u ratio is reduced at x > 0.6, as evident already 
from Fig. 3. 

We should note, however, that since our fitting region is restricted to x < 0.8 for the d- 
quark distribution, the fits can only extrapolate the behavior of the d/u ratio at 0.8 < x < 1. 
Furthermore, since similar parametric forms are used for both the u and d distributions, the 
d/u ratio is constrained to approach or infinity at the point x = 1. Nevertheless, in the 
absence of a dramatically large upturn in d/u beyond x ~ 0.8, the fits in the constrained 
region suggest a trend of the ratio towards a small limiting value. 

This result is qualitatively different from the fit of Yang and Bodek [34] , in which the ratio 
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of deuteron to proton and neutron F 2 structure functions was fitted with d/u constrained 
to approach the limit 1/5 at x — 1 [81]. However, the neutron structure function in that 
analysis was extracted assuming the nuclear density model [36] for the nuclear corrections 
in the deuteron. As discussed in Sec. II B, the extrapolation of the density model to the 
deuteron suffers from ambiguities associated with defining physically meaningful nuclear 
densities for the deuteron [37], and even for large nuclei the model should not be considered 
quantitative at very large x (x > 0.6-0.7) [36]. 

In contrast to the nuclear density model, where the nuclear correction is assumed to 
be independent of the bound nucleon structure, the size of the nuclear correction within 
the nuclear smearing approach depends also on the shape of the bound nucleon structure 
function through the convolution in Eq. (2). Functions with harder x distributions produce 
ratios F^/F^ which have greater depletion at intermediate and large x, and a delayed onset 
of the Fermi motion rise as x — > 1. This was observed in the quark model calculation of 
Ref. [30], which also illustrates the importance of the Q 2 dependence of the data at large x. 
Since the determination of the model parameters in [30] involved fitting to the large- re proton 
and deuteron data without including target mass and higher twist corrections, the resulting 
distributions were generally harder than the empirical PDFs, leading to a ratio F^/F^ in 
which the Fermi motion rise does not appear until x > 0.8. Incorporating the finite-Q 2 
corrections in the fits would result in softer distributions and consequently an earlier rise of 
F^/F^ due to Fermi motion at x ~ 0.6. 

It is clear that obtaining a precise behavior of d/u as x — > 1 requires a similarly precise 
knowledge of the nuclear corrections associated with the use of a deuterium target. While 
the results will inevitably have some dependence on the nuclear model adopted, our finding 
of a reduced ci-quark distribution will prevail in a wide range of models which incorporate 
the standard nuclear corrections such as Fermi motion, binding and nucleon off-shell effects. 
As seen in Fig. 7, inclusive DIS data on a deuterium target alone have little discriminatory 
power, except if the Q 2 range and precision of the measurements are increased. A more 
detailed analysis of the correlation between nuclear corrections and the d/u ratio at x — > 1, 
as well as of the dependence of d/u on the parametric forms, will be presented in a future 
publication [77]. 
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B. Direct experimental constraints on d/u 

To avoid nuclear corrections altogether, one would ideally need data on free nucleon 
targets which would constrain the <i-quark PDF. This suggests weak interaction processes 
where one can utilize variations of the flavor changing transition W + d — > u. An example 
would be the DIS processes vp — > [T X and Dp — > fi + X, which could probe the d and u 
PDFs, respectively, at large values of x. However, existing data sets for neutrino scattering 
from hydrogen do not have sufficient statistics to provide information in the x — > 1 region. 
A new high-statistics experiment on hydrogen, as is being considered by the MINERvA 
collaboration [83], would be required for such a study. 

A further possibility is provided by the W-lepton or W charge asymmetries measured at 
the Tevatron in pp collisions. These are sensitive to the d/u ratio, and data at large values 
of the lepton or W rapidity can reach x values as high as 0.8. However, this is at the edge 
of the kinematic coverage and the data there are statistically limited. Nevertheless, new 
data sets with increased statistics may help provide additional constraints on d/u which are 
independent of nuclear corrections. 

Another method which utilizes the weak interactions to probe the d quark involves parity- 
violating electron DIS on a hydrogen target [84, 85]. Here the asymmetry between left- and 
right-hand polarized electrons selects the interference between the 7 and Z-boson exchange, 
which depends on the d/u ratio weighted by electroweak charges. Such an experiment is 
planned at Jefferson Lab, taking advantage of the high luminosity and energy available 
following the 12 GeV energy upgrade, and with the expected 1% measurements of the 
asymmetry it would strongly constrain the d/u ratio up to x ~ 0.8 [84]. 

While still relying on nuclear targets, a novel idea proposes the use of mirror symmetric 
nuclei, such as 3 He and 3 H, in which the nuclear effects mostly cancel. Explicit calculations 
have confirmed that accurate extraction of F^/ i*f is possible from measured ratios of 3 He 
to 3 H structure functions, with nuclear corrections canceling to within ~ 1% up to x pa 0.85 
[86]. 

Finally, an experimental program is under way to determine the F 2 ™ structure function 
from measurements of DIS on a deuterium target with low-momentum spectator protons in 
the backward center-of-mass hemisphere, which tags DIS on an almost free neutron in the 
deuteron [87]. Preliminary results have confirmed the feasibility of this method, and future 
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measurements of spectator protons at the 12 GeV energy updated Jefferson Lab, together 
with 3 He or 3 H spectators on 4 He targets, will be helpful both in determining the d quark 
at large x and in constraining the nuclear correction models. 

VI. CONCLUSION 

In this analysis we have explored the possibility of extending the range of fitted DIS data 
in both W and Q to lower values than have been traditionally used in global PDF fits, in 
order to obtain an extension of the covered range of x. The results presented here show that 
excellent fits can result from such a procedure provided that target mass corrections, higher 
twist contributions, and nuclear corrections for deuterium targets are all taken into account. 
The resulting fits show that the leading twist u- and ci-quark PDFs are stable with respect 
to the choices made for implementing the target mass corrections, as long as a flexible higher 
twist parametrization is employed. 

The major new feature of our fits compared to previous global analyses is the stronger 
suppression of the ci-quark distribution at large x. However, the precise amount of suppres- 
sion is sensitive to the treatment of the nuclear corrections, which are an important source 
of theoretical uncertainty at x ~ 1. Hence, further progress in the determination of the 
behavior of the large- a; PDFs and the d/u ratio requires either a better understanding of 
the nuclear corrections or the use of data obtained using free nucleons in the initial state, 
for which we reviewed several experimental possibilities. 
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